Engineering posts about Data Warehousing
Curated summaries and key learnings for engineers working with Data Warehousing.
How World Bank Group uses databricks to eradicate poverty through shared knowledge
The World Bank Group has developed a unified data and AI platform on Databricks to integrate structured operational data with unstructured documents, thereby eliminating manual research bottlenecks....
Unlock seamless and cost-effective marketing campaigns with Lakebase
The article discusses the implementation and benefits of Lakebase, an architecture that combines the advantages of transactional databases with the flexibility of data lakes. It highlights the...
Automate Data & KPI Monitoring with SQL Alerts
The article introduces Databricks SQL Alerts, a tool designed to automate data monitoring and KPI tracking within organizations. It highlights the challenges of manual monitoring processes and...
Scaling Airbnb’s identity graph with a unified knowledge graph infrastructure
The article outlines Airbnb's shift from a PaaS model to an internally managed knowledge graph infrastructure, focusing on the identity graph that captures user relationships. It details the...
Backstage with Lakebase, part 2
In this second part of the series, the article discusses the integration of Backstage with Databricks Lakebase, emphasizing the transformation of database management from a complex, multi-service...
Data quality is the AI strategy
The article emphasizes the critical role of data quality in leveraging AI effectively within healthcare systems. It highlights NYU Langone Health's strategic approach to data management, where the...
How CFOs in consulting can recover margin with Databricks
The article outlines the financial challenges faced by consulting firms, particularly in managing data across disparate systems, which leads to inefficiencies and margin pressures. It emphasizes the...
The Rise of Sports Intelligence: How the Lakehouse Turns Tracking Data into Competitive Advantage
The article explores the transformative impact of the Databricks Data Intelligence Platform on professional sports through the integration of vast amounts of tracking and biomechanical data. It...
Migrating Data Ingestion Systems at Meta Scale
The article outlines the comprehensive migration of Meta's data ingestion system, which was essential for maintaining the efficiency and reliability of their social graph data processing. It details...
Growth Analytics Is What Comes After Growth Hacking
The article explores the evolution of growth analytics as a critical component in modern user acquisition strategies. It highlights the shift from tactical growth hacking to a more analytical...
Why telecom churn prediction misses the intervention window
The article explores the challenges faced by telecommunications companies in effectively predicting and intervening in customer churn. Despite the sophistication of churn propensity models,...
Operating room utilization is hiding in your scheduling data
The article highlights the critical importance of operating room (OR) utilization in healthcare systems, emphasizing that underutilized ORs represent significant revenue losses and unmet patient...
Energy trading analytics in a real-time market
The article highlights the challenges faced in energy trading analytics due to the fast-paced nature of price changes and the limitations of traditional batch processing methods. It emphasizes the...
How nOps Rebuilt Their Cloud Optimization Platform on Databricks Lakebase, and Why Other ISVs Should Too
The article outlines how nOps transitioned their cloud optimization platform to utilize Databricks Lakebase, a fully managed PostgreSQL database integrated with the Databricks Lakehouse. This...
Top Data Warehouse Tools For Modern Data Analytics
The article discusses the critical role of selecting the right data warehouse tools for modern data analytics, emphasizing the shift towards lakehouse architectures that integrate the functionalities...
Data Science vs Data Engineering: Choosing Analysis or Infrastructure
The article delineates the roles of data engineers and data scientists, emphasizing their distinct responsibilities and the collaborative nature of their work. Data engineers focus on building and...
Driving Budapest Forward: How BKK Uses Databricks to Transform City Mobility
The article outlines how BKK, Budapest's unified transport authority, has leveraged Databricks to modernize its data management and analytics capabilities. Faced with challenges from a legacy...
Why Your OEE Dashboard Is Lying to You
The article highlights the discrepancies between KPI dashboards and the actual performance of manufacturing equipment, specifically focusing on Overall Equipment Effectiveness (OEE). It identifies...
Backstage with Lakebase
The article explores the convergence of operational and analytical databases through the integration of Backstage with Databricks Lakebase, emphasizing the shift from traditional ETL processes to a...
The marketing activation gap has a fix: Databricks and Stitch partner to turn data infrastructure into marketing performance
The article outlines the collaboration between Databricks and Stitch, aimed at enhancing the integration of data infrastructure with marketing operations. It highlights the challenges faced by...